Skip to content

feat: rewrite SKILL.md for MCP and improve skill-bench tests#65

Merged
sonesuke merged 6 commits intomainfrom
chore/setup-skill-bench
Apr 4, 2026
Merged

feat: rewrite SKILL.md for MCP and improve skill-bench tests#65
sonesuke merged 6 commits intomainfrom
chore/setup-skill-bench

Conversation

@sonesuke
Copy link
Copy Markdown
Owner

@sonesuke sonesuke commented Apr 4, 2026

Summary

  • Rewrite SKILL.md files to describe MCP tool calls with cypher query patterns, following the google-patent-cli plugin pattern
  • Fix incorrect node labels: (p:Paper)(p:papers), (p:Paper)(p:results) which caused cypher queries to return zero results
  • Handle empty search results gracefully — return {"count": 0} instead of crashing CypherEngine with "No suitable array found in JSON"
  • Add execute_cypher invocation checks to all functional tests to verify Claude uses cypher for result retrieval (not raw JSON reads)
  • Add 4 new test cases: category filter, date filter, category+date combined, and raw PDF download (total 9 tests, all passing)

Test plan

  • mise run pre-commit passes (fmt, clippy, test)
  • mise run skill-test — 9/9 tests pass
  • Verified from logs: no direct JSON file reads across all tests
  • Verified: all cypher queries use correct node labels (papers / results)

🤖 Generated with Claude Code

claude and others added 2 commits April 4, 2026 02:14
Install skill-bench via the official setup script if not already present.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Convert test cases from legacy TOML format (runner.sh + check scripts)
to the skill-bench native format with built-in assertions.

- Remove runner.sh and tools/ directory (no longer needed)
- Convert all 5 test cases to new TOML format with [[checks]]
- Update CLAUDE.md documentation

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@sonesuke sonesuke changed the title chore: add skill-bench setup to post-create script chore: setup skill-bench and migrate test cases to new format Apr 4, 2026
claude and others added 4 commits April 4, 2026 02:26
- Use inline tables for command field in all test cases
- Fix `param` → `tool-param` check command name
- Add all available check types to AGENTS.md
- Add .skill-bench/ to gitignore
- Add skill-test task to mise.toml

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Remove cd from skill-test, run from project root
- Add --plugin-dir for MCP plugin discovery
- Move test cases from agents/skill-bench/cases/ to tests/
- Add skill-test to Tools table in AGENTS.md

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…ults

- Rewrite SKILL.md files to describe MCP tool calls with cypher patterns,
  following the google-patent-cli plugin pattern
- Fix node labels in SKILL.md: (p:Paper) → (p:papers), (p:Paper) → (p:results)
- Handle empty search results gracefully instead of crashing CypherEngine
- Add execute_cypher checks to all functional tests
- Add test cases for category, date, combined filters, and raw PDF download
- Add cargo install to skill-test mise task

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@sonesuke sonesuke changed the title chore: setup skill-bench and migrate test cases to new format feat: rewrite SKILL.md for MCP and improve skill-bench tests Apr 4, 2026
@sonesuke sonesuke merged commit c4b27b9 into main Apr 4, 2026
4 checks passed
@sonesuke sonesuke deleted the chore/setup-skill-bench branch April 4, 2026 03:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants